(19) 



(12) 



Europiisches Patentamt 
^WH European Patent Office 
" Office europeen des brevets EP 1 241 589 A2 

EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

18.09.2002 Bulletin 2002/38 



(21) Application number: 02004336.0 

(22) Date of filing: 01.03.2002 



(51) intci.': G06F 17/30, G06F 12/02 



(84) Designated Contracting States: 


(72) 


Inventors: 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 




Charkraborty, Krishnendu 


MC NLPTSETR 




San Mateo, California 94403 (US) 


Designated Extension States: 




Visvanathan, Jayashri 


AL LT LV MK RO SI 




Newark, California 94560 (US) 


(30) Priority: 01.03.2001 US 797630 


(74) 


Representative: HOFFMANN - EtTLE 




Patent- und RechtsanwMlte 


(71) Applicant: SUN MICROSYSTEMS, INC. 




Arabellastrasse 4 


Palo Alto, California 94303 (US) 




81925 Munchen (DE) 



Method and apparatus for freeing memory from an extensible markup language document 
object model tree active in an application cache 



(57) The present invention relates to a garbage col- 
lector that uses an LRU algorithm to free memory from 
an XML DOM tree active in an application cache. Ac- 
cording to one or more embodiments of the present in- 
vention, a threshold for the amount of memory permitted 
to reside in an application cache is set. Then, a garbage 
collector removes entries from the cache until it falls be- 
low the threshold. In one or more embodiments, a node 
table is used. When nodes are added to the XML DOM 
tree in the application cache the node table is updated. 



When the threshold f orthe amount of memory permitted 
to reside in the application cache is exceeded, the gar- 
bage collector applies an LRU algorithm uses the node 
table to determine which nodes to remove from the ap- 
plication cache. In one embodiment, the LRU algorithm 
scans the node table to determine the least recently 
used node in the table by examining time stamp entries 
in the table. Then, the algorithm removes that node and 
repeats the process until the XML DOM tree uses less 
memory in the cache than the threshold. 
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BACKGRO UND OF THE INVENTION 
1 ■ FIELD OF THE INVENTION 
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[0009] Properties are characteristics of an object; for example, the document object possesses a bgColour properry 
which reflects the background colour of the page. Using a programming language (e.g., JavaScript) one may, via this 
property, read or modify the colour of the current page. Some objects contain very many propert.es, some contain very 
few Some properties are read-only while others can be modified, possibly resulting in immediate on-screen results. 
[0010] A method typically executes an action which somehow acts upon the object by which it is owned. Sometimes 
the method also returns a result value. Methods are triggered by the programming language be.ng used, such as 
JavaScript. For example, the window object possesses a method named alert( ). When supplied wrth string data, the 
alert( ) method causes a window to pop up on the screen containing the data as its message; (e.g., alertf Invalid 
entry!")). 

[001 1 ] An event is used to trap actions related to its owning object. Typically, these actions are caused by the user. 
For example/when the user clicks on a submit button, this is a click event which occurs at the submit object. By virtue 
of submitting a form, a submit event is also- generated, following the click event. Although these events occur trans- 
parently one can choose to intercept them and trigger specified program code to execute. 

APPLICATION CACHE 

[0012] Since each component in the DOM is stored in memory, the DOM quickly becomes memory intensive^ In 
particular, the DOM typically forms a DOM tree which is stored in an area of memory called an application cache. The 
cache saves copies of web pages, images, and files (i.e., objects). Then, if there is another requestforthe same object, 
it will use the copy that it has, instead of asking the serverfor it again. There are two main reasons that caches are used: 

• To reduce latency - Because the request is satisfied from the cache (which is closer to the client) instead of the 
server, it takes less time for the client to get the object and display it. This makes web sites seem more responsive. 

. To reduce traffic - Because each object is only retrieved from the server once, it reduces the amount of bandwidth 
used by a client. This saves money if the client is paying by traffic, and keeps their bandwidth requirements lower 
and more manageable. 

[001 3] However, the cache is limited in size. Due to the large amount of data used by the DOM when it creates its 
trees the application cache quickly fills up. Currently, there is no way to free the cache of unnecessary, unneeded, or 
non-critical DOM trees. Hogging memory in the application cache has inhibited real time applications from widespread 
use of the XML DOM. 

SUMMARY OF THE INVENTION 

[0014] It is an object of the invention to provide an XML DOM tree with reduced memory requirements. 
[0015] This object of the invention is solved by a method for freeing memory from an XML DOM tree in a cache 
comprising storing one or more identifiers in a node table which correspond to each of one or more nodes of said XML 
DOM tree- scanning said node table to locate a least recently used node using said identifiers; and removing said 
identifiers and said least recently used node, if said XML DOM tree occupies more memory then a threshold. 
[001 6] Advantageously, one of said identifiers may comprise a time stamp entry associated with each of said nodes. 
[0017] Further, the scanning may comprise examining said time stamp entry to find said least recently used node, 
and may comprise determining whether said least recently used node has a child node; and modifying said time stamp 
associated with said least recently used node. 

[001 8] The modifying operation may further comprise changing said time stamp to a value of a most recently used 
child plus a millisecond. 

[0019] Still further, the scanning may comprise: determining whether a node is opened in multiple sessions by an 
identical user- selecting a most recently used copy of said node; placing one or more second identif lers for said node 
in an intermediate data structure; choosing a least recently used node in said data structure using said second iden- 
tifiers- removing said second identifiers from said node table. 

[0020] The object of the invention is further solved by a computer program product comprising a computer usable 
medium having computer readable program code embodied therein configured to free memory from an XML DOM tree 
in a cache, said computer program product comprising: computer readable code configured to cause a computer to 
store one or more identifiers for each of one or more nodes in said XML DOM tree in a node table; computer readable 
code configured to cause acomputerto scan said identifiers to locate a least recently used node; and computer readable 
code configured to cause a computer to remove said identifiers and said least recently used node, if said XML DOM 
tree occupies more memory then a threshold. 

[0021] The object of the invention is further solved by a garbage collector for freeing memory from an XML DOM 
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tree in a cache, said garbage collector comprising: a threshold for a maximum amount of ^^[^^ 
said cache; a node table configured to store one or more identifiers which correspond to each of one or more , nodes 
of said XML DOM tree- a scanner for examining said identifiers to locate a least recently used node, and a garbage 
conel™r removing said identifiers and said least recently used node, if said XML DOM tree occupies more memory 

ST'^ISin invention relates to an a.gohthm to free memory from an XML DOM tree 

the application cache the node table is updated. When the threshold for the amount of ^mory perm J *d *o 
the application cache is exceeded, an LRU algorithm applied by the garbage collector uses the node table to determine 
which nodes to remove from the application cache. rfr ^ Qr ,ti» nndP in the table 

[0024] in one embodiment, the algorithm scans the node table to determine the least recently used node n the tab j 

[0025] If the same user has opened the same XML node in multiple sessions, mu p reDe ated nodes 

will exlstforthe same user in the node table. In this situation, the most recently used ^^^^^^ 
becomes the time stamp for all of those nodes. To decide whether to remove this type of node ™° ^ 6 ^ ™ 
the XML garbage collector creates an intermediate data structure. The data "^"S ^hos pe ated 

node. The least recently used of all entries in the intermediate data structure is chosen and then, all of those repeated 
entries in the node table are removed. 
BRIEF DESCRIPTION OF THE DRAWINGS 

[0026] These and other features, aspects and advantages of the present invention will become better understood 
with regard to the following description, appended claims and accompanying draw.ngs where. 
Fig. 1 is a flowchart showing the operation of an XML garbage collector according to an embodiment of the present 
invention. 

Fig. 2 is a flowchart showing the operation of a garbage collector according to another embodiment of the present 
invention. 

Fig. 3 is a diagram of a node table according to an embodiment of the present invention. 

Fig. 4 is a flowchart showing the operation of a garbage coliector according to another embodiment of the present 
invention. 

Fig. 5 is a flowchart showing the operation of a garbage collector according to another embodiment of the present 
invention. 

Fig. 6 is a diagram of an XML DOM tree resident In an application cache. 

Fig. 7 is a diagram of a node table that might be used by an embodiment of the present invention. 
. Fig B is a diagram of a node table that might be used by an embodiment of the present invention. 

Fig. 9 is an embodiment of a computer execution environment where one or more embodiments of the present 
invention may be implemented. 

; DETAILED DESCRIPTION OF THE INVENTION 

rOD271 The oresent invention relates to a garbage collector that applies an algorithm to free memory from an XML 
DO Jtree "action appiSion cache. In tne foi.owing description, numerous specific details are set forth to provide 
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a more thorough description of embodiments of the invention. It will be apparent, however, to one skilled in the art, 
that the invention may be practiced without these specific details. In other instances, well known features have not 
been described in detail so as not to obscure the invention. 

5 XML GARBAGE COLLECTOR 

[0028] One embodiment of the present invention is shown in Figure 1 . In this embodiment, a threshold for the amount 
of memory permitted to reside in an application cache is set at step 100. Then, it is determined at step 110 whether 
the amount of memory used by the application cache exceeds the threshold. If it does not, then the process repeats 
10 at step 110. When the used memory in the cache exceeds the threshold, an XML garbage collector removes the least 
recently used entry from the cache at step 120. Then, the process repeats at step 110. 

NODE TABLE 

[0029] In one embodiment, a node table is used. When nodes are added to the XML DOM tree in the application 
cache the node table is updated. Addition of nodes happens in an independent thread. When the threshold for the 
amount Df memory permitted to reside in the application cache is exceeded, an LRU algorithm applied by the garbage 
collector uses the node table to determine which nodes to remove from the application cache. This occurs when the 
garbage collector is kicked off. II the garbage collector is kicked off too frequently overburdens the CPU. A garbage 
collector that is kicked off too seldom will not remove nodes frequently enough from the node table. 
[0030] In one embodiment of the present invention, the garbage collector is instantiated in its own thread that is a 
light weighted process. Threads typically work by sharing resources. So, they usually sleep, followed by an interval of 
activity, followed by an interval of sleep. The garbage collector thread acts in this manner. The period of time that the 
garbage collector sleeps may be chosen by the system administrator or it may take a default value. 
[0031] When the garbage collector thread wakes up, it checks to see if the memory required by the application is 
above a threshold. If so, ft starts cleaning up entries from the node table and the DOM cache until the memory falls 
below the stipulated limit. Once the memory goes below the limit, it sleeps. If the thread wakes up again and finds that 
the memory is below the threshold still, it goes back to sleep for a specific number of milliseconds again. 
[0032] This embodiment of the present invention is shown in Figure 2. First, a threshold for the amount of memory 
permitted to reside in an application cache is set at step 200. Then, at step 21 0 the garbage collector thread is awak- 
ened. At step 220 the memory usage is determined. Next, at step 240, ft is determined whether the amount of memory 
used by the application cache exceeds the threshold. If it does not, the garbage collector is put to sleep at step 245 
and the system waits for a specified number of milliseconds at step 250 before repeating step 210. 
[0033] When the used memory in the cache exceeds the threshold at step 240, the XML garbage collector uses an 
LRU algorithm at step 260 to scan the node table to find the LRU node. Once the LRU node is found, it is removed 
from the cache at step 270 and the process repeats at step 220. 

[0034] In one embodiment, the node table has entries for a nodelD, a sessionID, a user name, a time stamp, and a 
node path. An example of such a node table is shown in Figure 3. In Figure 3, it is seen that this embodiment of the 
node table has 5 columns with entries for nodelD, sessionID, user name, time stamp, and node path. The example 
shown in Figure 3 is for three users, John, Bill, and Jack and includes the complete paths for their nodes and the times 
they were entered into the XML DOM tree, as well as nodelDs and sessionlDs for those nodes. 

CHILD NODES 

is [0035] In one embodiment, the LRU algorithm scans a node table (the node table of Figure 3, for instance) to deter- 
mine the LRU node in the table by examining the time stamp entries in the table. Then, the algorithm removes that 
node and repeats the process until the XML DOM tree is smaller than the threshold. If the least recently used node 
has a child node opened by the same user, as indicated by the node path entry in the node table, it is not closed. 
Instead, the node that could not be closed has its time stamp modified to the value of the time stamp for its most 

so recently used child plus one millisecond. 

[0036] This embodiment of the present invention is shown in Figure 4. First, a threshold for the amount of memory 
permitted to reside in an application cache is set at step 400. Then, at step 410 the garbage collector thread is awak- 
ened. At step 420 the memory usage is determined. Next, at step 440, it is determined whether the amount of memory 
used by the application cache exceeds the threshold. If it does not, the garbage collector is put to sleep at step 445 

55 and the system waits for a specified number of milliseconds at step 450 before repeating step 420. 

[0037] When the used memory in the cache exceeds the threshold at step 440, the XML garbage collector uses an 
LRU algorithm at step 460 to scan the timestamp entries in the node table to find the LRU node. Once the LRU node 
is found, it is determined if that node has a child node opened by the same user at step 470 If not, it is removed from 
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USE CASE EXAMPLES 



Pass 3: 
Pass 4: 



hat th e e2. £Tt2 XML noM? ° Perati ° n ° f " embodimer * Qf the p — * invention. Assume, for instance 
n^^^T^^S^r " " Sh ° Wn " R9Ure 8 GiVSn the D ° M treS " F ^ 6. ^en the 

r0044 Git , !■ ? 9 9 C °" eCt0r St8rted rUnning mi9ht be arran S ed as show ^ ^ Figure 7 

^a^^S^fh arr h a, ; 9ement aS 15 shown in 7 aMurtno that the memory used by such an 

6XCeedS thS threSh ° ld m each pass of the collector through the node table, the following Sons 

PaSS 1 : ^nanoedto"^ r ! * 8 Child °' 5 ' ThBref ° re "° de 5 cannot be closed ' s ° «™ stamp for 5 

is changed to the timestamp of 6 4- 1 millisecond = 123122. 

Pass 2: Remove 6 since it is the LRU node. 

Remove 5 since it is the LRU node with the new time stamp of 123 122 and now has no children open. 
" S l' 2 i 4 remain ' 1 ,2 ' and 3 ' however ' belon 9 10 and are the same node opened in dffferent 

Pass 5: Close 1 ,2, and 3. 

l Z\ S L^r!llT^V 1 C L Perat, ° n ° f ^ embodiment ° f ««• present invention is shown with respect to the 

Pass 1 : Nodes 1 ,2, and 3 are the same node opened by John. 

Nodes 4,5, and 6 are the same nodes opened by Sam 

uLTt* 1 ' 2 ' T « ' n ° de 1 iS the m ° St reCent 'y used ° f 4,5, and 6, node 4 is the most recently 

Z Jt i < ■ 1 I and 4 P ' aCed " 3n ^ta structure and compared Since 4 is the LRU 

node ,n the ,ntermed,ate data structure, it is chosen and nodes 4,5, and 6 are removed from the ode taWe 
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Pass 2: Remove nodes 1 ,2, and 3. 

[0046] Pseudo-code that describes the operation of one embodiment of an XML garbage collector is shown in Ap- 
pendix A. 

s [0047] It is noted that a computer-readable medium may be provided having a program embodied thereon, where 
the program is to make a computer or a system of data processing devices to execute functions or operations of the 
features and elements of the above described examples. A computer-readable medium can be a magnetic or optical 
or other tangible medium on which a program is recorded, but can also be a signal, e.g. analog or digital, electronic, 
magnetic or optical, in which the program is embodied for transmission. Further, a computer program product may be 

10 provided comprising the computer-readable medium. 

[0048] — According^to-another-embodiment, a.program may be provided having instructions adapted to cause data 
processing means to carry out the operations of the above embodiments. Further, a computer readable medium may 
be provided in which the program is embodied. 

J5 EMBODIMENT OF COMPUTER EXECUTION ENVIRONMENT (HARDWARE) 

[0049] An embodiment of the invention can be implemented as computer software in the form of computer readable 
program code executed in a general purpose computing environment such as environment 900 Illustrated in Figure 9, 
or in the form of bytecode class files executable within a Java™ run time environment running in such an environment, 

20 or in the form of bytecodes running on a processor (or devices enabled to process bytecodes) existing in a distributed 
environment (e.g., one or more processors on a network). A keyboard 910 and mouse 911 are coupled to a system 
bus 91 8. The keyboard and mouse are for introducing user input to the computer system and communicating that user 
input to central processing unit (CPU) 913. Other suitable input devices may be used in addition to, or in place of, the 
mouse 911 and keyboard 910. I/O (input/output) unit 919 coupled to bi-directional system bus 918 represents such I/ 

25 o elements as a printer, AA/ (audio/video) I/O, etc. 

[0050] Computer 901 may include a communication interface 920 coupled to bus 91 B. Communication interface 920 
provides a two-way data communication coupling via a network link 921 to a local network 922. For example, If com- 
munication interface 920 is an integrated services digital network (ISDN) card or a modem, communication interface 
920 provides a data communication connection to the corresponding type of telephone line, which comprises part of 

30 network link 921 . If communication interface 920 is a local area network (LAN) card, communication interface 920 
provides a data communication connection via network link 921 to a compatible LAN. Wireless links are also possible. 
In any such implementation, communication interface 920 sends and receives electrical, electromagnetic or optical 
signals which carry digital data streams representing various types of information. 

[0051] Network link 921 typically provides data communication through one or more networks to other data devices. 

35 For example, network link 921 may provide a connection through local network 922 to host computer 923 or to data 
equipment operated by ISP 924. ISP 924 in turn provides data communication services through.the world wide packet 
data communication network now commonly referred to as the "Internet" 925. Local network 922 and Internet 925 both 
use electrical, electromagnetic or optical signals which carry digital data streams. The signals through the various 
networks and the signals on network link 921 and through communication interface 920, which carry the digital data 

40 to and from computer 900, are exemplary forms of carrier waves transporting the information. 

[0052] Processor 913 may reside wholly on client computer 901 or wholly on server 926 or processor 913 may have 
its computational power distributed between computer 901 and server 926. Server 926 symbolically is represented in 
Figure 9 as one unit, but server 926 can also be distributed between multiple "tiers". In one embodiment, server 926 
comprises a middle and back tier where application logic executes in the middle tier and persistent data is obtained in 

45 the back tier. In the case where processor 913 resides wholly on server 926, the results of the computationsperformed 
by processor 91 3 are transmitted to computer 901 via Internet 925, Internet Service Provider (ISP) 924, local network 
922 and communication interlace 920. In this way, computer 901 is able to display the results of the computation to a 
user in the form of output. 

[0053] Computer 901 includes a video memory 914, main memory 915 and mass storage 912, all coupled to bi- 
so directional system bus 918 along with keyboard 910, mouse 911 and processor 91 3. As with processor 913, in various 
computing environments, main memory 915 and mass storage 912, can reside wholly on server 926 or computer 901 , 
or they may be distributed between the two. Examples of systems where processor 913, main memory 91 5, and mass 
storage 912 are distributed between computer 901 and server 926 include the thin-client computing architecture de- 
veloped by Sun Microsystems, Inc., the palm pilot computing device and other personal digital assisiants, Internet 
55 ready cellular phones and other Internet computing devices, and in platform independent computing environments, 
such as those which utilize the Java technologies also developed by Sun Microsystems, Inc. XML DOM trees and 
identifiers for the nodes in the DOM trees may be stored in main memory 915 with a cache 990. Objects removed from 
the cache may be stored in an area 995 of mass storage 912. 
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a 32-bit data bus for transferring data ^ ™ j ™Kiplex data/address lines may be used ,nstead 

915, video memory 9U and mass storage 912. Atterna 
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(57) The present invention relates to a garbage col- 
lector that uses an LRU algorithm to free memory from 
an XML DOM tree active in an application cache. Ac- 
cording to one or more embodiments of the present in- 
vention, a threshold forthe amount of memory permitted 
to reside in an application cache is set. Then, a garbage 
collector removes entries from the cache until ft falls be- 
low the threshold. In one or more embodiments, a node 
table is used. When nodes are added to the XML DOM 
tree in the application cache the node table is updated. 



When the threshold forthe amount of memory permitted 
to reside in the application cache is exceeded, the gar- 
bage collector applies an LRU algorithm uses the node 
table to determine which nodes to remove from the ap- 
plication cache. In one embodiment, the LRU algorithm 
scans the node table to determine the least recently 
used node in the table by examining time stamp entries 
in the table. Then, the algorithm removes that node and 
repeats the process until the XML DOM tree uses less 
memory in the cache than the threshold. 
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Description 

BACKGROUND OF THE INVENTION 
5 1 . FIELD OF THE INVENTION 



[0001] The present invention relates to freeing memory from a 

uses a least recently used (LRU) algorithm to free memory from an extensible markup language (XML) 



2. BACKGROUND ART 

TO. Interna. * dnving an unprecedented demand -or ecc.ss ,0 i— n = , ™^ \Tb 
In.ormallon is presen.ad to a user is through . Braph.ca, uaar n »"~ ^ te » coloura . L Mricr gate, 

dale in tha proper formal, Ihe wee browser displays torroallod u». P l01ur * E ' "° u ™ 0 .,„„„.„ ,htmu was onglnally 
To in.truat a web browse, ,o praa.ol me data in the deair.d manner, hypertext ™«^>"^*Z™^L 5 K the 
used HTML is a language whereby a lile i. created that ha, tb. n.oe.a.ry data and also mform.t.on 9 

;Zr™e,e,,h.s recently amazed 

trees, an overview of the Internet i8 provided below, 
INTERNET 

10005] Tha interna. Is a network connecting many computer network, and* based on a c J™^ d »« s 'r™ 
and communica.ionspro.oco, calledTCP/,P (Transmission Control ^"^^^^ t^ST^L 

rrren«==diL^ 

;zr?'r: s ebTa=wr„=ma,b^ = ^ 

and transmit documents (i.e. , web pages) to other computers on the networK wnerv « ' retrieved 

eS,=^- 
an online address called a Uniform Resource Locator (URL). 

XML DOM 

and methods relevant to the submit button in a form. 
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[0009] Properties are characteristics of an object; for example, the document object possesses a bgColour propeny 
which reflects the background colour of the page. Using a programming language (e.g., JavaScript) one may, vie this 
property, read or modify the colour of the current page. Some objects contain very many properties, some contain very 
few Some properties are read-only while others can be modified, possibly resulting in immediate on-screen results. 
[0010] A method typically executes an action which somehow acts upon the object by which It is owned. Sometimes 
the method also returns a result value. Methods are triggered by the programming language being used, such as 
JavaScript. For example, the window object possesses a method named alert( ). When supplied with string data, the 
alert( ) method causes a window to pop up on the screen containing the data as its message; (e.g., alertflnvalid 
entry!")). 

[001 1 ] An event is used to trap actions related to its owning object. Typically, these actions are caused by the user. 
For example, when the user clicks on a submit button, this is a click event which occurs at the submit object. By virtue 
of submitting a form, a submit event is also generated, following the click event. Although these events occur trans- 
parently, one can choose to intercept them and trigger specified program code to execute. 

APPLICATION CACHE 

[0012] Since each component In the DOM is stored in memory, the DOM quickly becomes memory intensive. In 
particular, the DOM typically forms a DOM tree which is stored in an area of memory called an application cache. The 
cache saves copies of web pages, images, and files (i.e., objects). Then, if there is another request for the same object, 
it will use the copy that it has, instead of asking the serverfor it again. There are two main reasons that caches are used: 

. To reduce latency - Because the request is satisfied from the cache (which is closer to the client) instead of the 
server, it takes less time for the client to get the object and display it. This makes web sites seem more responsive. 

• To reduce traffic - Because each object is only retrieved from the server once, it reduces the amount of bandwidth 
used by a client. This saves money if the client is paying by traffic, and keeps their bandwidth requ.rements lower 
and more manageable. 

[0013] However, the cache is limited in size. Due to the large amount of data used by the DOM when it creates its 
trees the application cache quickly fills up. Currently, there is no way to free the cache of unnecessary, unneeded, or 
non-critical DOM trees. Hogging memory in the application cache has inhibited real time applications from widespread 
use of the XML DOM. 

SUMMARY OF THE INVENTION 

[0014] It is an object of the invention to provide an XML DOM tree with reduced memory requirements. 
[0015] This object of the invention is solved by a method for freeing memory from an XML DOM tree in a cache 
comprising: storing one or more identifiers in a node table which correspond to each of one or more nodes of said XML 
DOM tree- scanning said node table to locate a least recently used node using said identifiers; and removing said 
identifiers and said least recently used node, if said XML DOM tree occupies more memory then a threshold. 
[0016] Advantageously, one of said identifiers may comprise a time stamp entry associated with each of said nodes. 
[0017] Further, the scanning may comprise examining said time stamp entry to find said least recently used node, 
and may comprise determining whether said least recently used node has a child node; and modifying said time stamp 
associated with said least recently used node. 

[001 8] The modifying operation may further comprise changing said time stamp to a value of a most recently used 
child plus a millisecond. 

[0019] Still further, the scanning may comprise: determining whether a node is opened in multiple sess.ons by an 
identical user; selecting a most recently used copy of said node; placing one or more second identifiers for said node 
in an intermediate data structure; choosing a least recently used node in said data structure using said second iden- 
tifiers; removing said second identifiers from said node table. 

[0020] The object of the invention is further solved by a computer program product comprising a computer usable 
medium having computer readable program code embodied therein configured to free memory from an XML DOM tree 
in a cache, said computer program product comprising: computer readable code configured to cause a computer to 
store one or more identifiers for each of one or more nodes in said XML DOM tree in a node table; computer readable 
code configured to cause acomputer to scan said identifiers to locate a least recently used node; and computer readable 
code configured to cause a computer to remove said identifiers and said least recently used node, if said XML DOM 
tree occupies more memory then a threshold. 

[0021] The object of the invention is further solved by a garbage collector for freeing memory from an XML DOM 
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tree in a cache, said garbage collector comprising: a threshold for a maximum amount of memory that may reside in 
said cache; a node table configured to store one or more identifiers which correspond to each of one or more nodes 
of said XML DOM tree; a scanner for examining said identifiers to locate a least recently used node: and a garbage 
collector for removing said identifiers and said least recently used node, if said XML DOM tree occupies more memory 
then said threshold. 

[0022] The present invention relates to an algorithm to free memory from an XML DOM tree active in an application 
cache. According to one or more embodiments of the present invention, a threshold forthe amount of memory permitted 
to reside in an application cache is set. Then, an XML garbage collector removes entries from the cache until it falls 
below the threshold. 

[0023] In one or more embodiments, a node table is used. One embodiment of the node table has entries for a 
nodelD, a sessionID, a user name, a time stamp, and a node path. When nodes are added to the XML DOM tree in 
the application cache the node table is updated. When the threshold for the amount of memory permitted to reside in 
the application cache is exceeded, an LRU algorithm applied by the garbage collector uses the node table to determine 
which nodes to remove from the application cache. 

[0024] In one embodiment, the algorithm scans the node table to determine the least recently used node in the table 
by examining the time stamp entries in the table. Then, the algorithm removes that node and repeats the process until 
the XML DOM tree is smaller than the threshold. If the least recently used node has a child node opened by the same 
user, as indicated by the node path entry in the node table, It is not closed. Instead, the node that could not be closed 
has its time stamp modified to the value of the time stamp for its most recently used child plus one millisecond. 
[0025] If the same user has opened the same XML node in multiple sessions, multiple entries for the same nodelD 
will exist forthe same user in the node table. In this situation, the most recently used time stamp forthe repeated nodes 
becomes the time stamp for all of those nodes. To decide whether to remove this type of node, one embodiment of 
the XML garbage collector creates an intermediate data structure. The data structure holds one entry for each repeated 
node. The least recently used of all entries in the intermediate data structure is chosen and then, all of those repeated 
entries in the node table are removed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0026] These and other features, aspects and advantages of the present invention will become better understood 
with regard to the following description, appended claims and accompanying drawings where: 

Fig. 1 is a flowchart showing the operation of an XML garbage collector according to an embodiment of the present 
invention. 

Fig. 2 is a flowchart showing the operation of a garbage collector according to another embodiment of the present 
invention. 

Fig. 3 is a diagram of a node table according to an embodiment of the present invention. 

Fig. 4 is a flowchart showing the operation of a garbage collector according to another embodiment of the present 
invention. 

Fig. 5 is a flowchart showing the operation of a garbage collector according to another embodiment of the present 
invention. 

Fig. 6 is a diagram of an XML DOM tree resident in an application cache. 

Fig. 7 is a diagram of a node table that might be used by an embodiment of the present invention. 

Fig 8 is a diagram of a node table that might be used by an embodiment of the present invention. 

Fig. 9 is an embodiment of a computer execution environment where one or more embodiments of the present 
invention may be implemented. 

DETAILED DESCRIPTION OF THE INVENTION 

[0027] The present invention relates to a garbage collector that applies an algorithm to free memory from an XML 
DOM tree active in an application cache. In the following description, numerous specific details are set forth to provide 
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a more thorough description of embodiments of the invention. It will be apparent, however, to one skilled in the art, 
that the invention may be practiced without these specific details. In other instances, well known features have not 
been described in detail so as not to obscure the invention. 

s XML GARBAGE COLLECTOR 

[0028] One embodiment of the present invention is shown in Figure 1 . In this embodiment, a thresholdforthe amount 
of memory permitted to reside in an application cache is set at step 100. Then, it is determined at step 110 whether 
the amount of memory used by the application cache exceeds the threshold. If it does not, then the process repeats 
10 at step 110. When the used memory in the cache exceeds the threshold, an XML garbage collector removes the least 
recently used entry from the cache at step 120. Then, the process repeats at step 110. 

NODE TABLE 

[0029] In one embodiment, a node table is used. When nodes are added to the XML DOM tree in the application 
cache the node table is updated. Addition of nodes happens in an independent thread. When the threshold for the 
amount of memory permitted to reside in the application cache is exceeded, an LRU algorithm applied by the garbage 
collector uses the node table to determine which nodes to remove from the application cache. This occurs when the 
garbage collector is kicked off. If the garbage collector is kicked off too frequently overburdens the CPU. A garbage 
collector that is kicked off too seldom will not remove nodes frequently enough from the node table. 
[0030] In one embodiment of the present invention, the garbage collector is instantiated in its own thread that is a 
light weighted process. Threads typically work by sharing resources. So, they usually sleep, followed by an interval of 
activity, followed by an interval of sleep. The garbage collector thread acts in this manner. The period of time that the 
garbage collector sleeps may be chosen by the system administrator or it may take a default value. 
[0031] When the garbage collector thread wakes up, it checks to see if the memory required by the application is 
above a threshold. If so, it starts cleaning up entries from the node table and the DOM cache until the memory falls 
below the stipulated limit. Once the memory goes below the limit, it sleeps. If the thread wakes up again and finds that 
the memory is below the threshold still, it goes back to sleep for a specific number of milliseconds again. 
[0032] This embodiment of the present invention is shown in Figure 2. First, a threshold for the amount of memory 
permitted to reside in an application cache is set at step 200. Then, at step 210 the garbage collector thread is awak- 
ened. At step 220 the memory usage is determined. Next, at step 240, it is determined whether the amount of memory 
used by the application cache exceeds the threshold. If it does not, the garbage collector is put to sleep at step 245 
and the system waits for a specified number of milliseconds at step 250 before repeating step 210. 
[0033] When the used memory in the cache exceeds the threshold at step 240, the XML garbage collector uses an 
LRU algorithm at step 260 to scan the node table to find the LRU node. Once the LRU node is found, it is removed 
from the cache at step 270 and the process repeats at step 220. 

[0034] In one embodiment, the node table has entries for a nodelD, a sessionID, a user name, a time stamp, and a 
node path. An example of such a node table is shown in Figure 3. In Figure 3, it is seen that this embodiment of the 
node table has 5 columns with entries for nodelD, sessionID, user name, time stamp, and node path. The example 
shown in Figure 3 is for three users, John, Bill, and Jack and includes the complete paths for their nodes and the times 
they were entered into the XML DOM tree, as well as nodelDs and sessionlDs for those nodes. 

CHILD NODES 

45 [0035] In one embodiment, the LRU algorithm scans a node table (the node table of Figure 3, for instance) to deter- 
mine the LRU node in the table by examining the time stamp entries in the table. Then, the algorithm removes that 
node and repeats the process until the XML DOM tree is smaller than the threshold. If the least recently used node 
has a child node opened by the same user, as indicated by the node path entry in the node table, it is not closed. 
Instead, the node that could not be closed has its time stamp modified to the value of the time stamp for its most 

so recently used child plus one millisecond. 

[0036] This embodiment of the present invention is shown in Figure 4. First, a threshold for the amount of memory 
permitted to reside in an application cache is set at step 400. Then, at step 41 0 the garbage collector thread is awak- 
ened. At step 420 the memory usage is determined. Next, at step 440, it is determined whether the amount of memory 
used by the application cache exceeds the threshold. If it does not, the garbage collector is put to sleep at step 445 

ss and the system waits for a specified number of milliseconds at step 450 before repeating step 420. 

[0037] When the used memory in the cache exceeds the threshold at step 440, the XML garbage collector uses an 
LRU algorithm at step 460 to scan the timestamp entries in the node table to find the LRU node. Once the LRU node 
is found, it is determined if that node has a child node opened by the same user at step 470 If not, it is removed from 
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the cache at step 480 and the process repeats at step 410. 

[0038] If, however, the LRU node identified at step 460 has a child node opened by the same user, as indicated by 
the node path entry in the node table, it is not closed. Instead, the node that could not be closed has its time stamp 
modified to the value of the time stamp for its most recently used child plus one millisecond at step 490 and the process 
repeats at step 460. 

MULTIPLE SESSIONS FOR THE SAME USER 

[0039] If the same user has opened the same XML node in multiple sessions, multiple entries for the same nodelD 
will exist f orthe same user in the node table. In this situation, the most recently used time stamp forthe repeated nodes 
becomes the time stamp for all ot those nodes. To decide whether to remove this type of node, one embodiment of 
the XML garbage collector creates an intermediate data structure. The intermediate data structure holds one entry for 
each repeated node. The least recently used of all entries in the intermediate data structure is chosen and then, all of 
those repeated entries in the node table are removed by the garbage collector. 

[0040] This embodiment of the present invention is shown in Figure 5. First, a threshold forthe amount of memory 
permitted to reside in an application cache is set at step 500. Then, at step 510 the garbage collector thread is awak- 
ened. At step 520 the memory usage is determined. Next, at step 540, it is determined whether the amount of memory 
used by the application cache exceeds the threshold. If It does not, the garbage collector is put to sleep at step 545 
and the system waits for a specified number of milliseconds at step 550 before repeating step 510. 
[0041] When the used memory in the cache exceeds the threshold at step 540, then at step 560 the most recently 
used node is chosen in each instance where the same user has opened the node in multiple sessions. Then, those 
most recently used nodes are placed in an intermediate data structure (such as an array) at step 570. 
[0042] Next, the garbage collector uses an LRU algorithm at step 580 to scan the intermediate data structure to find 
the LRU node. Once the LRU node is found, all entries in the node tabic that correspond to that node are removed at 
step 590 and the process repeats at step 520. 

USE CASE EXAMPLES 

[0043] The following is an example of the operation of an embodiment of the present invention. Assume, for instance, 
that the cache had the XML DOM tree in memory that is shown in Figure 6. Given the DOM tree of Figure 6, then the 
node table at the time the XML garbage collector started running might be arranged as shown in Figure 7. 
[0044] Given such a table arrangement as is shown in Figure 7 and assuming that the memory used by such an 
arrangement exceeds the threshold in each pass of the garbage collector through the node table, the following actions 
would occur: 

Pass 1 : Try to close node 5, but node 6 is a child of 5. Therefore node 5 cannot be closed, so the time stamp for 5 
is changed to the timestamp of 6 + 1 millisecond = 123122. 

Pass 2: Remove 6 since it is the LRU node. 

Pass 3: Remove 5 since it is the LRU node with the new time stamp of 123122 and now has no children open. 

Pass 4: Nodes 1 ,2,3, and 4 remain. 1 ,2, and 3, however, belong to John and are the same node opened in different 
sessions. The most recently used among 1,2, and 3 is picked (which is 1). Then, between 1 and 4, 4 is the 
LRU node, so it is removed. 

Pass 5: Close 1 ,2, and 3. 

[0045] Another example of the operation of an embodiment of the present invention is shown with respect to the 
node table arrangement shown in Figure 8. Assuming that the node table is arranged as shown in Figure 8 when the 
XML garbage collector begins running and assuming that on each pass of the node table, the cache memory exceeds 
the threshold, then the following actions would occur: 

Pass 1 : Nodes 1 ,2, and 3 are the same node opened by John. 

Nodes 4,5, and 6 are the same nodes opened by Sam. 

Of nodes 1 ,2, and 3, node 1 is the most recently used. Of nodes 4,5, and 6, node 4 is the most recently 
used. Thus, nodes 1 and 4 are placed in an intermediate data structure and compared. Since 4 is the LRU 
node in the intermediate data structure, it is chosen and nodes 4,5, and 6 are removed from the node table. 
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Pass 2: Remove nodes 1 ,2, and 3. 

[0046] Pseudo-code thai describes the operation of one embodiment of an XML garbage collector is shown in Ap- 
pendix A. 

5 [0047] It is noted that a computer-readable medium may be provided having a program embodied thereon, where 
the program is to make a computer or a system of data processing devices toexecute functions or operations of the 
features and elements of the above described examples. A computer-readable medium can be a magnetic or optical 
or other tangible medium on which a program is recorded, but can also be a signal, e.g. analog or digital, electronic, 
magnetic or optical, in which the program is embodied for transmission. Further, a computer program product may be 

10 provided comprising the computer-readable medium. 

[0048] According to another embodiment, a program may be provided having instructions adapted to cause data 
processing means to carry out the operations of the above embodiments. Further, a computer readable medium may 
be provided in which the program is embodied. 

J5 EMBODIMENT OF COMPUTER EXECUTION ENVIRONMENT (HARDWARE) 

[0049] An embodiment of the Invention can be implemented as computer software in the form of computer readable 
program code executed in a general purpose computing environment such as environment 900 Illustrated in Figure 9, 
or in the form of bytecode class files executable within a Java™ run time environment running in such an environment, 

20 or in the form of bytecodes running on a processor (or devices enabled to process bytecodes) existing in a distributed 
environment (e.g., one or more processors on a network). A keyboard 910 and mouse 911 are coupled to a system 
bus 918. The keyboard and mouse are for introducing user input to the computer system and communicating that user 
input to central processing unit (CPU) 91 3. Other suitable input devices may be used in addition to, or in place of, the 
mouse 911 and keyboard 91 0. I/O (input/output) unit 919 coupled to bi-directional system bus 918 represents such I/ 

25 O elements as a printer, AA/ (audio/video) I/O, etc. 

[0050] Computer 901 may include a communication interface 920 coupled to bus 918. Communication interface 920 
provides a two-way data communication coupling via a network link 921 to a local network 922. For example, if com- 
munication interface 920 is an integrated services digital network (ISDN) card or a modem, communication interface 
920 provides a data communication connection to the corresponding type of telephone line, which comprises part of 

30 network link 921 . If communication interface 920 is a local area network (LAN) card, communication interface 920 
provides a data communication connection via network link 921 to a compatible LAN. Wireless links are also possible. 
In any such implementation, communication interface 920 sends and receives electrical, electromagnetic or optical 
signals which carry digital data streams representing various types of information. 

[0051] Network link 921 typically provides data communication through one or more networks to other data devices. 

35 For example, network link 921 may provide a connection through local network 922 to host computer 923 or to data 
equipment operated by ISP 924. ISP 924 in turn provides data communication services through the world wide packet 
data communication network now commonly referred to as the "Internet" 925. Local network 922 and Internet 925 both 
use electrical, electromagnetic or optical signals which carry digital data streams. The signals through the various 
networks and the signals on network link 921 and through communication interface 920, which carry the digital data 

40 to and from computer 900, are exemplary forms of carrier waves transporting the information. 

[0052] Processor 913 may reside wholly on client computer 901 or wholly on server 926 or processor 91 3 may have 
its computational power distributed between computer 901 and server 926. Server 926 symbolically is represented in 
Figure 9 as one unit, but server 926 can also be distributed between multiple "tiers". In one embodiment, server 926 
comprises a middle and back tier where application logic executes in the middle tier and persistent data is obtained in 

is the back tier. In the case where processor 91 3 resides wholly on server 926, the results of the computations performed 
by processor 91 3 are transmitted to computer 901 via Internet 925, Internet Service Provider (ISP) 924, local network 
922 and communication interlace 920. In this way, computer 901 is able to display the results of the computation to a 
user in the form of output. 

[0053] Computer 901 includes a video memory 914, main memory 915 and mass storage 912, all coupled to bi- 
so directional system bus 918 along with keyboard 910, mouse 911 and processor 91 3. As with processor 913, in various 
computing environments, main memory 91 5 and mass storage 912, can reside wholly on server 926 or computer 901 , 
or they maybe distributed between the two. Examples of systems where processor 913, main memory 915, and mass 
storage 912 are distributed between computer 901 and server 926 include the thin-client computing architecture de- 
veloped by Sun Microsystems, Inc., the palm pilot computing device and other personal digital assistants, Internet 
55 ready cellular phones and other Internet computing devices, and in platform independent computing environments, 
such as those which utilize the Java technologies also developed by Sun Microsystems, Inc. XML DOM trees and 
identifiers for the nodes in the DOM trees may be stored in main memory 915 with a cache 990. Objects removed from 
the cache may be stored in an area 995 of mass storage 912. 
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[0054] The mass storage 912 may include both fixed B nd removable media, such as magnetic, optical or magnetic 
optical storage systems or any other available mass storage technology. Bus 91 B may contain, for example, thirty-two 
address lines for addressing video memory 914 ormain memory 915. The system bus 91 8 also includes, for example, 
a 32-bit data bus for transferring data between and among the components, such as processor 913, main memory 
915, video memory 914 and mass storage 912. Alternatively, multiplex data/address lines may be used instead of 
separate data and address lines. .... u 

[0055] In one embodiment of the invention, the processor 913 is a microprocessor manufactured by Motorola, such 
as the 680X0 processor or a microprocessor manufactured by Intel, such as the B0X86, or Pentium processor, or a 
SPARC microprocessor from Sun Microsystems, Inc. However, any other suitable microprocessor or microcomputer 
may be utilized Main memory 915 is comprised of dynamic random access memory (DRAM). Video memory 914 is a 
dual-ported video random access memory. One port of the video memory 914 is coupled to video amplifier 91-6. The 
video amplifier 916 is used to drive the cathode ray tube (CRT) raster monitor 917. Video amplifier 91 6 is well known 
in the art and may be implemented by any suitable apparatus. This circuitry converts pixel data stored in video memory 
914 to a raster signal suitable for use by monitor 91 7. Monitor 91 7 is a type of monitor suitable for displaying graphic 

[0056] S Computer 901 can send messages and receive data, including program code, through the network(s) , network 
link 921 and communication interface 920. In the Internet example, remote server computer 926 might transmit a 
requested code for an application program through Internet 925, ISP 924, local network 922 and communication in- 
terface 920. The received code may be executed by processor 913 as it is received, and/or stored in mass storage 
912 or other non-volatile storage for later execution. In this manner, computer 900 may obtain appl.cauon code in the 
form of a carrier wave. Alternatively, remote server computer 926 may execute applications using processor 913 : and 
utilize mass storage 912, and/or video memory 915. The results of the execution at server 926 are then transmitted 
through internet 925, ISP 924, local network 922 and communication interface 920. In this example, computer 901 
performs only input and output functions. 

[0057] Application code may be embodied in any form of computer program product. A computer program product 
comprises a medium configured to store or transport computer readable code, or in which computer readable code 
may be embedded. Some examples of computer program products are CD-ROM disks, ROM cards, floppy d.sks, 
magnetic tapes, computer hard drives, servers on a network, and carrier waves. 

[0058] The computer systems described above are for purposes of example only An embod.ment of the invention 
may be implemented in any type of computer system or programming or processing environment. 
[0059] Thus a garbage collector that uses an LRU algorithm to free memory from an XML DOM tree active in an 
application cache is described in conjunction with one or more specific embodiments. The invention is defined by the 
claims and their full scope of equivalents. 
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APPENDIX A 



Pseudo-code for an embodiment of an XML garbage collector 



Function FreeMemory 
Begin function 

while (memory < specified limit specified by administrator) 
for every pass in the node table 

/ /nodeld uniquely represents a node in memory 

int nodeld - search node table for least recently used node and return nodeld 
If (nodeld is not getting used by other users) 

Childrenfound - search for all other entries to find if any other children of the same 

node was opened 

if (childrenfound = false) 

remove node from cache and entry from node table 
call FreeMemory recursively 

} 

else 

^ //Lower the priority of the node wrt its children so that it is visited only 

after children are closed 

change the timestamp of this node to the most recently used time stamp of 

children + 1 millsecs 
call FreeMemory recursively 

} } 

else if (same nodeld has multiple entries in Node table) 

^ long timedur = find out the most recently used entry for the node for that specific 
user 

create a temporary data structure 

fill it with a list of all contendors with multiple and single entries from the remaining 
nodes 

If the most recently used entry (among all the entries in temp data structure) is the 
least recently used 

childrenfound - Search for all other entries to find if any other children of 

the same node was opened 
if(childrenfound — false) 

remove node from cache and all entries from node table 
call FreeMemory recursively 

} 

else 
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call FreeMemory recursively 



30 



} } 

} //End of for loop 
} // End of while loop 
End function 



Claims 

1 • A method for freeing memory from an XML DOM tree in a cache, said method comprising: 

DoZeT " " * M to each of one or more nodes of said XML 

scanning said node table to iocate a least recently used node using said identifiers: and 
tZ^reTo,r tifierS ' eaSt rSCent,y USed n ° de ' " Sa ' d XML DOM tree occupies more memory 

2. The method of claim 1 wherein one of said identifiers comprises a time stamp entry associated with each of said 

3. The method of at leas, one of the claims 1 and 2 wherein said scanning further comprises: 

examining said time stamp entry to find said least recently used node. 
* 4. The method of at least one of the preceding claims wherein said scanning further comprises: 

determining whether said least recently used node has a child node; 
^ modifying said time stamp associated with said least recently used node. 

5. The method of claim 4 wherein said modifying further comprises: 

changing said time stamp to a value of a most recently used child plus a millisecond. 
« 6. The method of at leas, one of the preceding claims wherein said scanning further comprises: 

determining whether a node is opened in multiple sessions by an identical user; 
o selecting a most recently used copy of said node; 

placing one or more second identifiers for said node in an intermediate data structure; 
choosing a least recently used node in said data structure using said second identifiers; 
removing said second identifiers from said node table. 
7- A program having instructions adapted to carry out the method of at least one of the claims 1-6. 
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8. A computer readable medium, in which a program is embodied, where the program is to make a computer execute 
the method of at least one of the claims 1 - 6. 

9. A computer program product comprising: 

5 

a computer usable medium having computer readable program code embodied therein configured to free 
memory from an XML DOM tree in a cache, said computer program product comprising: 

computer readable code configured to cause a computer to store one or more identifiers for each ol one 
w or more nodes in said XML DOM tree in a node table; 

computer readable code configured to cause a computer to scan said identifiers to locate a least recently 
used node; and 

15 computer readable code configured to cause a computer to remove said identifiers and said least recently 

used node, if said XML DOM tree occupies more memory then a threshold. 

10. The computer program product of claim 9 wherein one of said identifiers comprises a time stamp entry associated 
with each of said nodes. 

11. The computer program product of at least one of the claims 9 and 10 wherein said computer readable code con- 
figured to cause a computer to scan further comprises: 

computer readable code configured to cause a computer to examine said time stamp entry to find said least 
2s recently used node. 

12. The computer program product of at least one of the claims 9 to 1 1 wherein said computer readable code configured 
to cause a computer to scan further comprises: 

30 computer readable code configured to cause a computer to determine whether said least recently used node 

has a child node; 

computer readable code configured to cause a computer to modify said time stamp associated with said least 
recently used node. 

13. The computer program product of claim 1 2 wherein said computer readable code configured to cause a computer 
to modify further comprises: 

computer readable code configured to cause a computer to change said time stamp to a value of a most 
40 recently used child plus a millisecond. 

14. The computer program product of at least one of the claims 9 to 1 3 wherein said computer readable code configured 
to cause a computer to scan further comprises: 

as computer readable code configured to cause a computer to determine whether a node is opened in multiple 

sessions by an identical user; 

computer readable code configured to cause a computer to select a most recently used copy of said node; 

so computer readable code configured to cause a computer to place one or more second identifiers for said node 

in an intermediate data structure; 

computer readable code configured to cause a computer to choose a least recently used node in said data 
structure using said second identifiers; 

55 

computer readable code configured to cause a computer to remove said second identifiers from said node 
table. 
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15. A garbage collector for freeing memory from an XML DOM tree in a cache, said garbage collector comprising: 

a threshold for a maximum amount of memory that may reside in said cache; 

a node table configured to store one or more identifiers which correspond to each of one or more nodes of 
said XML DOM tree; 

a scanner for examining said identifiers to locate a least recently used node; and 

a garbage collector for removing said identifiers and said least recently used node, if said XML DOM tree 
occupies more memory then said threshold. 

16. The garbage collector of claim 15 wherein one of said identifiers is a time stamp entry associated with each of 
said nodes. 

17. The garbage collector of at least one of the claims 15 and 16 wherein said scanner further comprises: 

an examiner for looking at said time stamp entry to find said least recently used node. 

18. The garbage collector of at least one of the claims 15 to 1 7 wherein said step of scanning further comprises: 

a determiner for determining whether said least recently used node has a child node; 
a modifier for modifying said time stamp associated with said least recently used node. 

19. The garbage collector of claim 18 wherein said modifier further comprises: 

a second time stamp configured to be changed to a value of said time stamp for a most recently used child 
plus a millisecond. 

20. The garbage collector of at least one of the claims 15 to 19 wherein said step of scanning further comprises: 

a second determiner for determining whether a node is opened in multiple sessions by an identical user; 
a selector for obtaining a most recently used copy of said node; 

an intermediate data structure to place one or more second identifiers relating to said node; 

a chooser for choosing a least recently used node in said data structure using said second identifiers; and 

a remover to remove every instance of said'second identifiers in said node table. 
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